Reduced complexity transform-domain adaptive filter using selective partial updates

ABSTRACT

A transform-domain adaptive filter uses selective partial updating of adaptive filter parameters. This updating may be based on a constrained minimization problem. The adaptive filter parameters are separated into subsets, and a subset is selected to be updated at each iteration. A normalization process applied to the frequency bins prior to multiplication by the adaptive filter parameters is used to prevent adaptive filter lock-up that may be experienced in the event of high energy levels of signals in particular frequency bins. Convergence of the transform domain filter is ensured at a rate generally faster than a corresponding time-domain adaptive filter. The transform-domain adaptive filter may be used for various applications, including system identification, channel equalization, or echo cancellation.

RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No. 60/350,489, filed Nov. 13, 2001 and U.S. Provisional Application No. 60/396,298, filed Jul. 16, 2002. The entire teachings of the above applications are incorporated herein by reference.

BACKGROUND OF THE INVENTION

Adaptive filters can be used in various systems, but require powerful processors if time critical operation is needed. An example of such an application is an echo canceller in a telecommunications system.

FIG. 1 is a schematic diagram of an example prior art telecommunications system 100 a in which an echo canceller 120 employs an adaptive filter. The telecommunications system 100 a includes a near-end telephone 105 a and a far-end telephone 105 b. The near-end telephone 105 a is connected to a near-end hybrid 110 a via a telephone line 125 a. The far-end telephone 105 b is connected to a far-end hybrid 110 b via a telephone line 125 b. The near-end hybrid 110 a is connected to the far-end hybrid 110 b via telephone lines 130 and 135 that are part of a public switched telephone network (PSTN) 115. The echo canceller 120 resides in the PSTN to remove an echo signal caused by the far-end hybrid 110 b as a result of hybrid mismatch.

FIG. 2 is a schematic diagram of an alternative example of the prior art telecommunications system 100 b in which the near-end telephone 105 a, near-end hybrid 110 a, and associated telephone line 125 a are replaced with a wireless subsystem 200. The wireless subsystem 200 includes a handset 205 connected to a base station 210 via a wireless link 215. The wireless subsystem 200 also includes a mobile station controller (MSC) 220 connected to the PSTN 115, having the echo canceller 120 with adaptive filter, that transmits voice signals via the telephone lines 130 and 135 to the far-end hybrid 110 b.

FIGS. 3–5 are schematic diagrams of adaptive filters used in various applications.

FIG. 3 provides details of the echo cancellation application in the prior art telecommunications systems of FIGS. 1 and 2. Referring to FIG. 3, in general, the echo canceller 120 determines and cancels echos received from an unknown system 505, where the unknown system 505 includes the set of transmission lines 130, 135 and the far-end hybrid 110 b. Because of a hybrid mismatch, an echo signal 510 travels from the far-end hybrid 110 b via the telephone line 135 back to the echo canceller 120.

The echo canceller 120 includes an adaptive filter 320 a that uses adaptive filter parameters, w_(k), or weights. A summing unit 315 sums the output, y_(k), of the adaptive filter 320 a with a far-end signal, d_(k), which is composed of the echo signal 510, far-end speech, and noise. The output from the summing unit 315 is an error signal, e_(k), which is just the far-end speech and noise if the adaptive filter 320 a in the echo canceller 120 perfectly matches the echo signal 510.

FIG. 4 is an example of the adaptive filter 320 a used in a prior art system identification application 300. The adaptive filter 320 a is used in concert with the summing unit 315 to determine characteristics of an unknown plant 310. If the squared error, |e_(k)|², is minimal, then the adaptive filter 320 a has reached convergence, and the characteristics of the unknown plant 310 are determined from converged parameters in the adaptive filter 320 a.

FIG. 5 is a schematic diagram of a channel equalization application 400 in which the adaptive filter 320 a is used to determine characteristics of a channel 410, such as a speech channel in the PSTN 115 (FIG. 1). A delay unit 405 is used to estimate the delay of the channel 410. If the squared error, |e_(k)|², is minimized, then the adaptive filter 320 a has reached convergence and has removed distortion the channel 410 may have added to a signal traveling in the channel 410. The characteristics of the channel 410 can be determined as a function of the adaptive filter parameters, w_(k), in the adaptive filter 320 a.

In channel equalization applications, slow convergence of adaptive filters can result from an existence of deep nulls in the channel frequency response. This invariably leads to a large eigenspread and a requirement for long channel equalizers in the case of baud-rate sampling. In acoustic echo cancellation applications, the echo path is estimated by an adaptive filter. The acoustic echo path often requires a large number of parameters for adequate modeling. Speech signals also present a particularly challenging case because they tend to be non-stationary and to have lowpass spectral features.

SUMMARY OF THE INVENTION

Computational complexity and convergence speed are two important considerations for stochastic gradient descent adaptive filtering algorithms. The computational complexity of the least-mean-square (LMS) algorithm for an N-tap adaptive filter is O(N). The convergence rate of LMS is dependent on the eigenspread or the condition number (the ratio of the largest eigenvalue to the smallest eigenvalue) of the autocorrelation matrix of the input signal. Difficult adaptive filtering problems present long adaptive filters (i.e., large N) and input signals with large eigenspread. The resulting adaptive filter tends to have large computational complexity and slow convergence characteristics.

A remedy for slow convergence is to whiten the adaptive filter input signal by using an orthogonal transformation and power-normalization. This is often referred to as transform-domain adaptive filtering. While the use of orthogonal transformation and power-normalization speeds up the algorithm convergence by reducing the eigenspread of the adaptive filter input signal, it leads to no reduction in the number of filter parameters.

Before discussing transform-domain adaptive filtering, a discussion of time domain adaptive filtering using selective partial updates will be discussed in reference to FIGS. 6 and 7.

FIG. 6 is a detailed schematic diagram of the adaptive filter 320 a implemented in a time domain structure. The time domain adaptive filter 320 a is composed of filter parameters 600, where the filter parameters 600 include delay elements 605 and weights 610 with which the input signal, x_(k), is multiplied, as shown. The outputs from the multipliers 610 are added together by a summing unit 615. The summed output, y_(k), from the summing unit 615 is used to determine the error between the input, x_(k), and an output from a system or channel being analyzed, as described above in reference to FIGS. 3–5.

The adaptive filter 320 a of FIG. 6 is an example of a time domain adaptive filter, discussed in a paper by K. Dogancay and O. Tanrikulu, “Selective-Partial-Update NLMS and Affine Projection Algorithms for Acoustic Echo Cancellation,” in Proc. of IEEE Int'I Conf. on Acoustics Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 2000, vol. 1, pp. 448–451. When used to model the unknown plant 310 (FIG. 4), for example, the filter may become too long (e.g., filter parameters ranging in the hundreds or thousands), which may be too costly in terms of processing time or computer memory.

In terms of processing time, the problem with the time domain adaptive filter structure 320 a is that the signals at each multiplier 610 are highly correlated, since each such signal is one time sample away from the previous and next signal at the previous and next multiplier, respectively. This degree of correlation causes each coefficient to be correlated and interdependent with the other coefficients. So, the slowest converging coefficient controls the convergence of the entire filter.

In the event the filter 320 a is a long filter but there are not enough computing resources (i.e., computing time, memory, etc.) to adequately support the long filter, then one way to save on computing resources is to select a subset of filter parameters to update rather than trying to update all filter parameters at each filtering iteration.

FIG. 7 is an example of the time domain adaptive filter 320 a that has its filter parameters separated into two different subsets of filter parameters: subset A 705 a and subset B 705 b. In this example, the subsets 705 a, 705 b include two subsets of three parameters (i.e., delays 605 and weights 610). Alternatively, the subsets 705 a, 705 b may include (i) three subsets of two parameters, (ii) one subset of two parameters and one subset of four parameters, or (iii) six subsets of one parameter. The tradeoff is that the finer the granulation, the higher the need for computing resources for subset selection.

Some prior art adaptive filters have employed heuristic techniques to determine which subset of filter parameters is to be updated at each iteration. In the above-referenced paper published by these same inventors, a closed-form solution was presented for determining which subset of filter parameters to update if a processor, on which the adaptive filter is executed, does not have enough computing resources to change all of the filter parameters each iteration.

A full derivation of the closed-form solution can be found in K. Do{hacek over (g)}ancay and O. Tanrikulu, “Selective Partial Update NMLS and Affine Projection Algorithms for Acoustic Echo Cancellation,” in Proc. of IEEE Int. Conf on Acoustics, Speech, and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 2000, vol. 1, pp. 448–451. A subset of the derivation is repeated below.

The closed-form solution can be solved by starting with a constrained optimization equation:

${\min\limits_{w_{k + 1}}{\left\{ {{w_{k + 1} - w_{k}}}_{2}^{2} \right\}{s \cdot t \cdot x_{k}^{T} \cdot w_{k + 1}}}} = d_{k}$

In this equation, a cost function is solved by minimizing the difference between a next-coefficient, ω_(k+1), and the current coefficient w_(k) such that the input signal multiplied by the next coefficient is equal to the echo signal, d_(k). The constraining terms are referred to as a posteriori output since they look at the next coefficient rather than the previous coefficient, which would be used in an a priori output scenario.

Solving the above equation, it should be noted that in this time domain closed-form solution, all of the coefficients are updated and, if the error is zero, the next coefficient is equal to the present coefficient:

$w_{k + 1} = {w_{k} + {\frac{1}{{x_{k}}_{2}^{2}}{e_{k} \cdot x_{k}}}}$

So, to reduce the computation time of having to update every coefficient at every iteration, a second minimizing function is applied. In this second minimizing function, the index ‘i’ refers to the subsets of parameters, and the subset of parameters selected to be updated is that which is determined to be the minimum of the following equation:

${\begin{matrix} \min \\ {1 < i < M} \end{matrix}\left\{ \begin{matrix} \min \\ {w_{i}\left( {k + 1} \right)} \end{matrix}||{{w_{i}\left( {k + 1} \right)} - {w_{i}(k)}}||_{2}^{2} \right\}{s \cdot t \cdot {w^{T}\left( {k + 1} \right)}}x_{k}} = d_{k}$

Similarly, the parameter subset index is applied to the following equation:

${w_{{k + 1},i} = {w_{k,i} + {\frac{1}{{x_{k,i}}_{2}^{2}}e_{k}x_{k,i}}}},$

which can be solved as:

$\left( {w_{{k + 1},i} - w_{k,i}} \right) = {\frac{1}{{x_{k,i}}_{2}^{2}}e_{k}{x_{k,i}.}}$

Thus, the parameter subset to be updated can be determined by solving the following equation:

$i =_{j}^{argmin}\left( {\frac{e_{k}x_{k,j}}{{x_{k,j}}_{2}^{2}}}_{2}^{2} \right)$

which reduces to:

i=_(j)^(argmax)x_(k, j )₂².

So, according to the closed-form solution, the parameter subset to update is the one that gives the maximum sum of squares, which leads to faster convergence of the filter than updating a random subset or sequential updating.

Although a level of improvement is found for the time-domain adaptive filter of FIG. 7 by selecting a given subset of adaptive filter parameters to update at each iteration rather than updating all parameters, the time-domain adaptive filter is inherently slow to converge due to the correlation between signals at each multiplier. Thus, convergence of the overall time domain adaptive filter is only as fast as the slowest converging adaptive filter parameter.

In contrast, a transform-domain or frequency-domain adaptive filter operates in parallel in that the adaptive filter parameters are applied to signals in different frequency bins in parallel. Because there is not the same level of correlation between signals at each multiplier, the transform-domain adaptive filter converges faster than the time-domain adaptive filter, albeit at the expense of a more complex filter structure.

To achieve further improvement in performance by a transform-domain adaptive filter, similar to the time-domain adaptive filter approaches discussed above, the number of transform-domain filter parameters adapted at each iteration can also be reduced by use of selective partial updating of the adaptive filter parameters. This updating may be based on a constrained minimization problem, optionally including calculating a squared-Euclidean-norm. The result of this approach is to segment the adaptive filter parameters into subsets and choose a subset, or block, to update at each iteration. A normalization process applied to at least two frequency bins prior to multiplication by the adaptive filter parameters is used to prevent an adaptive filter “lock-up” that may be experienced in the event of high energy levels of signals in particular frequency bins. Thus, convergence of the transform domain filter is ensured at a rate generally faster than the convergence rate of a corresponding time-domain adaptive filter.

This transform-domain adaptive filter may be used for various applications, including system identification, channel equalization, or echo cancellation.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular description of preferred embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.

FIG. 1 is a schematic diagram of a telecommunications system having an echo canceller with an adaptive filter;

FIG. 2 is a schematic diagram of a telecommunications system having a wireless subnetwork and using the echo canceller of FIG. 1;

FIG. 3 is a schematic diagram of the adaptive filter of FIG. 1 used in an echo cancelling application;

FIG. 4 is a schematic diagram of the adaptive filter of FIG. 1 used in a system identification application;

FIG. 5 is a schematic diagram of the adaptive filter of FIG. 1 used in a channel equalization application;

FIG. 6 is a schematic diagram of the adaptive filter of FIG. 1 implemented in a time-domain structure;

FIG. 7 is the schematic diagram of FIG. 6 in which the adaptive filter has parameters organized in subsets;

FIG. 8A is a schematic diagram of an inventive transform-domain adaptive filter that may be employed in the applications of FIGS. 3–5;

FIG. 8B is a block diagram of the adaptive filter of FIG. 8A;

FIG. 9 is a plot of energy versus time for a signal in a frequency bin in the adaptive filter of FIG. 8A;

FIG. 10 is an energy plot for the four frequency bins in the adaptive filter of FIG. 8A;

FIG. 11 is a flow diagram of a process executed by the adaptive filter of FIG. 8A;

FIG. 12 is a graph of convergence curves for full-update TD-LMS, SPU-TD-MLS of FIG. 8A, and NLMS for W=3.5; and

FIG. 13 is a graph of the convergence curves of FIG. 12 for W=3.8 to illustrate slower convergence.

DETAILED DESCRIPTION OF THE INVENTION

A description of preferred embodiments of the invention follows.

FIG. 8A is a schematic diagram of the inventive transform-domain adaptive filter 800. The adaptive filter 800 is a transform away from the time-domain adaptive filter described above in reference to FIG. 7. Although transform domain filters require more memory and are more complex, they have faster convergence than time-domain adaptive filters, even though time-domain adaptive filters are simpler in implementation. The reason for the faster convergence is that the signals being processed are separated into frequency bins, and the signals in each of the frequency bins are not correlated as in the case of the time-domain adaptive filter, as discussed above. A unique problem with the transform-domain adaptive filter not found in the case of time domain adaptive filter is referred to as “lock-up,” which was learned during the design and implementation of this transform-domain adaptive filter 800.

In teleconferencing equipment on the user's side of a telecommunications network, there is typically only one line. Higher cost for inefficient echo cancellation processing, which drives the cost of the equipment higher than would a more efficient echo cancelling processing model, is absorbed by the end user. However, on the network side where network service providers absorb the cost of a large number of lines, inefficient echo cancellation processing is more important for cost and speed of service reasons. Echo cancellers having the inventive transform-domain adaptive filter 800 can support both end users and network service providers and can reduce the costs for the network service providers resulting from efficiency and convergence speed.

Referring to FIG. 8A, the transform-domain adaptive filter 800 receives a time-domain input signal, x_(k), at an orthogonal transform unit 805. The orthogonal transform unit 805 performs a time-to-frequency domain transform to provide signals in distinct frequency bins for further processing in the frequency domain. Candidate orthogonal transforms are the Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT), Discrete Hartley Transform (DHT), and Walsh-Hadamard Transform (WHT), to name a few. It should be understood that non-orthogonal transforms may also be employed.

In this particular embodiment, for N=4, the orthogonal transform unit 805 outputs frequency signals in four different frequency bins, referred to as: v_(k,0), v_(k,1), v_(k,2), and v_(k,3).

The four frequency bin signals have instantaneous energy levels and are normalized by a normalizing processor 1000. In this particular embodiment, the normalizing processor 1000 includes four multipliers 1005, which multiply the respective signals in each of the frequency bins by one-divided-by a respective, square root of average, energy level (1/σ), where σ is the square root of the average energy level.

FIG. 9 provides a time plot having the instantaneous energy levels of the zero'th frequency bin signal, v_(k,0), and the average energy level of that signal over time, σ²v_(k,0). An example of an equation that can be used to calculate the average energy level is: σ_(i) ²(k)=ασ_(i) ²(k−1)+(1−α)|v_(k,i)|², where α=0.95.

This equation includes a scale factor, α, that is used to balance the average energy of the frequency bin signal as a function of the previous average energy level and the present instantaneous energy level of the signal.

The normalizing unit 1000 prevents the adaptive filter 800 from going into a lock-up mode. The cause for the lock-up mode can be understood from an energy diagram in FIG. 10.

Referring to FIG. 10, four energy levels, one corresponding to each frequency bin 0, 1, 2, 3, are shown as instantaneous energy vectors 915. The zero'th and first energy levels are much higher than the second and third energy levels. Because of this disparity, without the normalizing unit 1000, the zero'th and first adaptive filter parameters would be adjusted at each iteration while the second and third adaptive filters would not be adjusted, which is defined as a lock-up mode. Thus, the adaptive filter 800 would not achieve convergence without the normalizing unit 1000 since the second and third adaptive filter parameters would not reach convergence.

Referring again to FIG. 8A, following the normalizing unit 1000, the adaptive filter 800 has a set of multipliers 1010 that multiply the normalized signals in the frequency bins, q_(k,0), q_(k,1), q_(k,2), . . . , q_(k,N−1), by respective adaptive filter parameters, z₀(k), z₁(k), z₂(k), . . . , z_(N−1)(k).

To minimize need for computing resources, not all of the adaptive filter parameters are updated each iteration. In support of this minimization, the frequency domain adaptive filter parameters 320 b are separated into subsets A 905 and B 910, where, in this case, each subset 905 and 910 includes two adaptive filter parameters.

In this particular embodiment, the adaptive filter 800 selects which subset of adaptive filter parameters to update by solving a constrained minimization equation. The constrained minimization equation may include calculating a squared-Euclidean-norm of the normalized signals in the frequency bins.

After the normalized signals q(k) have been combined with the adaptive filter parameters, z,(k), the weighted signals of the frequency bins are summed together by a summing unit 615. The output from the summing unit 615 is referred to as the output signal, y_(k). This output signal, y_(k), is subtracted from an external signal, d_(k), by a summing unit 315.

The sum of the external signal, d_(k), and a negative value of the output signal, y_(k), results in an error signal, e_(k), which is fed back to the adaptive filter 800 via a feedback path 810 to allow the adaptive filter 800 to adjust its filter parameters, z(k). When the squared error, |e_(k)|², has reached a minimum, the adaptive filter 800 has converged to a solution.

The converged adaptive filter 800 provides information based on the application in which it is employed. For example, the adaptive filter parameters can be used to cancel an echo signal in a signal in a telecommunications application, identify a system in a systems identification application, or determine the characteristics of a channel in a channel equalization application. In the echo signal cancellation application, when the adaptive filter 800 has converged to a solution, the echo signal in the time domain signal related to the frequency domain signals has been minimized.

FIG. 11 is a flow diagram of a process 1100 executed by the inventive adaptive filter 800 of FIG. 8A. The process 1100 begins in step 1105 after receiving the frequency domain signals. In the embodiment of the adaptive filter 800 of FIG. 8A, an N×N orthogonal transform 805 provides the frequency domain signals v_(k,0), v_(k,1), and so forth. In step 1110, the process 1100 normalizes the frequency domain signals in at least two frequency bins. In the adaptive filter 800 of FIG. 8A, there may be four frequency bins that are normalized.

In step 1115, the process 1100 adaptively filters the normalized signals, q_(k,0), q_(k,1), . . . , q_(k,N−1). In step 1120, the process 1100 updates a subset of filter parameters. The process 1100 repeats with the updated subset of filter parameters among the other subset(s) of filter parameters.

FIG. 8B is a block diagram of the adaptive filter 800 of FIG. 8A. The adaptive filter parameters 320 b and mathematical processing units 1010 are indicated as separate blocks. An update unit 815 is defined as a separate block for completeness but should be understood as being inherently part of the adaptive filter 800 in FIG. 8A. The update unit 800 may have access to all of the data received from the normalizers 1000, adaptive parameters 320 b, output y_(k) from the mathematical processing units 1010, and error e_(k). The update unit 800 uses a subset of the data to determine “next” values of a subset of the adaptive filter parameters 320 b. The update unit 815 may use the equations discussed herein to determine the “next” values.

It should be understood that the adaptive filter 800 may be implemented in software and executed using hardware, for example, a digital signal processor (not shown). The block diagram of FIG. 8B and schematic diagram of FIG. 8A are merely representations of the software and hardware and in no way intended to be limiting. Alternative representations and descriptors may be used to show and describe the processing. Alternative forms of hardware and software may be used to implement the adaptive filter, such as parallel processors, general purpose computers, matrix processors, distributed processors, and software operating thereon, respectively.

The following discussion provides mathematical details for the inventive transform-domain adaptive filter 800 discussed above.

For an adaptive filter of length N with regressor vector x(k)=[x(k), x(k−1), . . . , x(k−N+1)]^(T) and parameter vector ω(k)=[ω₀(k), ω₁(k), . . . , ω_(N−1)(k)]^(T), the LMS algorithm is given by the recursion ω(k+1)=ω(k)+μe(k)x(k)  (1) where μ is a positive stepsize, which controls the speed of convergence, and e(k) is the error signal given by e(k)=d(k)−y(k). Here, d(k) is the desired filter response, and y(k) is the filter output y(k)=w^(T)(k)x(k). The convergence speed of the LMS algorithm in equation (1) is sensitive to the condition number of the autocorrelation matrix of x(k). If the condition number is large, LMS is expected to have a slow rate of convergence. Whitening (i.e., decorrelating) the input signal can be considered as a remedy for this problem by reducing the eigenspread.

Transforming the regressor vector x, using an N×N orthogonal transformation, denoted T, a transform vector is produced, where: v(k)=[v_(k,0), v_(k,1) , . . . , v _(k,N−1)]^(T) =Tx(k).

The matrix T is a unitary matrix (i.e., T^(H)T=TT^(H)=I ). Candidate orthogonal transforms are the discrete Fourier transform (DFT), discrete cosine transform (DCT), discrete Hartley transform (DHT), and Walsh-Hadamard transform (WHT), to name but a few. The entries of the transformed vector v(k) are (approximately) uncorrelated with variances σ_(i) ²=E{|v_(k,i)|²}, i=0, 1, . . . , N−1. If σ_(i) ² were known a priori, the entries of v(k) could be normalized to yield entries with unit variance: q(k)=Λ⁻¹ v(k) where Λ is a diagonal autocorrelation matrix defined by

$\Lambda = {\begin{bmatrix} \sigma_{0} & \; & \; & 0 \\ \; & \sigma_{1} & \; & \; \\ \; & \; & ⋰ & \; \\ 0 & \; & \; & \sigma_{N - 1} \end{bmatrix}.}$

The vector q(k) with unit variance (i.e., approximately uncorrelated entries) is applied to an adaptive filter with parameters z(k)=[z₀(k),z₁(k), . . . , z_(N−1)(k)]^(T). See W. K. Jenkins, A. W. Hull, J. C. Straight, B. A. Schnaufer, and X. Li, Advanced Concepts in Adaptive Signal Processing, Kluwer, Boston, 1996. FIG. 8A, discussed above, depicts the resulting transform-domain adaptive filter. The filter parameters z(k) can be adapted by using the LMS algorithm: z(k+1)=z(k)+μe(k)q*(k) where * denotes complex conjugate and e(k)=d(k)−z ^(T)(k)q(k). The power-normalization can be absorbed into the adaptive filter parameters using w(k)=Λ⁻¹ z(k)  (2) which yields w(k+1)=w(k)+μe(k)Λ⁻¹ q*(k) or w(k+1)=w(k)+μe(k)Λ⁻² v*(k).  (3) This is the transform-domain LMS (TD-LMS) algorithm. The matrix Λ² is not readily available and needs to be estimated on-line. This can be done by using a sliding exponential window: σ_(i) ²(k)=ασ_(i) ²(k−1)+(1−α)|v _(k,i)|² , i=0, 1, . . . , N−1 where 0<α<1. Use of Selective-Partial-Updates:

The notion of selective-partial-updating stems from a desire to reduce the number of multiplications in the update term by adapting only a small number of the adaptive filter parameters. The regressor vector q(k) and the parameter vector z(k) can be partitioned into M blocks of length L=N/M, where L is an integer: q(k)=[q ₁ ^(T)(k)q ₂ ^(T)(k) . . . q _(M) ^(T)(k)]^(T) z(k)=[z ₁ ^(T)(k)z ₂ ^(T)(k) . . . z _(M) ^(T)(k)]^(T). Here, the parameter vector blocks z₁(k), z₂(k), . . . , z_(M)(k) represent candidate subsets of z(k) that can be updated at time instant k. For a single-block update, the following constrained minimization problem can be defined, which is solved by the NLMS algorithm:

$\begin{matrix} {{{{\overset{\min}{1 \leq i \leq M}\mspace{34mu}\overset{\min}{\left. {z_{i}\left( {k + 1} \right)} \right.||}{z_{i}\left( {k + 1} \right)}} - {z_{i}(k)}}\mathop{\text{||}}_{2}^{2}{{subject}\mspace{14mu}{to}\mspace{11mu}{z^{T}\left( {k + 1} \right)}{q(k)}}} = {d(k)}} & (4) \end{matrix}$ Consider first the minimization problem for a given block. If i is fixed, equation (4) reduces to

$\begin{matrix} {{{{\overset{\min}{\left. {z_{i}\left( {k + 1} \right)} \right.||}{z_{i}\left( {k + 1} \right)}} - {z_{i}(k)}}\mathop{\text{||}}_{2}^{2}{{subject}\mspace{14mu}{to}\mspace{11mu}{z^{T}\left( {k + 1} \right)}{q(k)}}} = {d(k)}} & (5) \end{matrix}$ which can be solved by using the method of Lagrange multipliers. See G. C. Goodwin and K. S. Sin, Adaptive Filtering, Prediction, and Control, Prentice Hall, New Jersey, 1984. The cost function to be minimized is J _(i)(k)=∥z _(i)(k+1)−z _(i)(k)∥₂ ²+λ(d(k)−z^(T)(k+1)q(k)) where λ is a Lagrange multiplier. Setting ∂J_(i)(k)/∂z_(i)(k+1)=0 and ∂J_(i)(k)/∂λ=0, we get

$\begin{matrix} {{{z_{i}\left( {k + 1} \right)} - {z_{i}(k)} - {\frac{\lambda}{2}{q_{i}^{*}(k)}}} = 0} & \text{(6a)} \\ {{{d(k)} - \underset{\underset{{z^{T}{({k + 1})}}{q{(k)}}}{︸}}{\left( {{{z_{i}^{T}\left( {k + 1} \right)}{q_{i}(k)}} + {{{\overset{\_}{z}}_{i}^{T}\left( {k + 1} \right)}{{\overset{\_}{q}}_{i}^{T}(k)}}} \right)}} = 0} & \text{(6b)} \end{matrix}$ where {overscore (q)}_(i)(k) is obtained from q(k) by deleting q_(i)(k), and {overscore (z)}_(i)(k+1) is defined likewise. Substituting (6a) into (6b), we get

$\begin{matrix} {\frac{\lambda}{2} = \frac{{d(k)} - {{z_{i}^{T}(k)}{q_{i}(k)}} - {{{\overset{\_}{z}}_{i}^{T}\left( {k + 1} \right)}{{\overset{\_}{q}}_{i}(k)}}}{{q_{i}^{H}(k)}{q_{i}(k)}}} & \text{(7a)} \\ {= \frac{{d(k)} - {{z^{T}(k)}{q(k)}}}{{{q_{i}(k)}}_{2}^{2}}} & \text{(7b)} \\ {= \frac{e(k)}{{{q_{i}(k)}}_{2}^{2}}} & \text{(7c)} \end{matrix}$ where we have used {overscore (z)}_(i)(k+1)={overscore (z)}_(i)(k). After substituting (7c) into (6a) and introducing a small positive stepsize μ, we obtain the following recursion to solve the fixed-block-update constrained minimization problem in (5):

$\begin{matrix} {{z_{i}\left( {k + 1} \right)} = {{z_{i}(k)} + {\frac{\mu}{{{q_{i}(k)}}_{2}^{2}}{q_{i}^{*}(k)}{{e(k)}.}}}} & (8) \end{matrix}$

The selection of a block to be updated should be made by determining the block with the smallest squared-Euclidean-norm update (see equation (4)). Thus, the block to be updated at time instant k is given by

$\begin{matrix} {i = {{\underset{1 \leq j \leq M}{argmin}{{{z_{j}\left( {k + 1} \right)} - {z_{j}(k)}}}_{2}^{2}} = {{\underset{1 \leq j \leq M}{argmin}{\frac{{q_{j}^{*}(k)}{e(k)}}{{{q_{j}(k)}}_{2}^{2}}}_{2}^{2}} = {{\underset{1 \leq j \leq M}{argmin}\frac{1}{{{q_{j}(k)}}_{2}^{2}}} = {\underset{1 \leq j \leq M}{argmin}{\frac{1}{{{q_{j}(k)}}_{2}^{2}}.}}}}}} & (9) \end{matrix}$ Equation (8) can be modified to incorporate the selection criterion (9). This yields the following selective-single-block-update NLMS algorithm:

$\begin{matrix} {{{z_{i}\left( {k + 1} \right)} = {{z_{i}(k)} + {\frac{\mu}{{{q_{i}(k)}}_{2}^{2}}{q_{i}^{*}(k)}{e(k)}}}}{i = {\underset{1 \leq j \leq M}{argmin}{{{q_{j}(k)}}_{2}^{2}.}}}} & (10) \end{matrix}$

For white inputs, the NLMS algorithm does not offer any advantages over the LMS algorithm. Thus, one can drop the stepsize normalization to obtain the LMS version:

$\begin{matrix} {{z_{i}\left( {k + 1} \right)} = {{{z_{i}(k)} + {{{\mu q}_{i}^{*}(k)}{e(k)}\mspace{14mu} i}} = {\underset{1 \leq j \leq M}{argmin}{{{q_{j}(k)}}_{2}^{2}.}}}} & (11) \end{matrix}$ Using (2) in (11), we further obtain

$\begin{matrix} {{w_{i}\left( {k + 1} \right)} = {{{w_{i}(k)} + {\mu\;\Lambda_{i}^{- 2}{v_{i}^{*}(k)}{e(k)}\mspace{14mu} i}} = {\underset{1 \leq j \leq M}{argmin}{{q_{j}(k)}}_{2}^{2}}}} & (12) \end{matrix}$ which is the transform-domain selective-partial-update LMS (TD-SPU-LMS) algorithm. In equation (12), Λ_(i) is an L×L diagonal matrix defined through

$\Lambda = {\begin{bmatrix} \Lambda_{1} & \; & \; & \; \\ \; & \Lambda_{2} & \; & \; \\ \; & \; & ⋰ & \; \\ \; & \; & \; & \Lambda_{M} \end{bmatrix}.}$

The selective-partial-update algorithm in equation (12) can be generalized to multiple-block selection: w _(I) _(B) (k+1)=w _(I) _(B) (k)+μΛ_(I) _(B) ⁻² v _(I) _(B) ^(*)(k)e(k) where I_(B)={i:∥q_(i)(k)∥₂ ² is one of the B largest in ∥q₁(k)∥₂ ², . . . , ∥q_(M)(k)∥₂ ²} where I_(B)={i₁, i₂, . . . , i_(B)} is the set of selected blocks, determining which blocks to be retained in the parameter vector, regressor vector, and the autocorrelation matrix. For example, the vector w_(I) _(B) (k) is defined by w _(I) _(B) (k)=[w _(i) ₁ ^(T)(k), w_(i) ₂ ^(T)(k), . . . , w _(i) _(B) ^(T)(k)]^(T). Note that setting M=N and B=M corresponds to the full-update TD-LMS in equation (3).

Computational Complexity

Selective-partial updating reduces the number of multiplications in the update term. For given B and M values, the computational complexity for the regressor vector in the update term is N B/M or BL (complex) multiplications, compared to N (complex) multiplications. The selection process introduces some overhead. If the Heapsort algorithm (see D. E. Knuth, Sorting and Searching, vol. 3 of The Art of Computer Programming, Addison-Wesley, Reading, Mass., 2^(nd) Edition, 1973) is used, the computational complexity for the selection of B blocks is a maximum of M log₂ B comparisons. Comparing the savings in multiplications,

(M−B)L=N−BL with the number of comparisons, it becomes clear that, for long adaptive filters (large N), the savings in multiplications exceeds the number of comparisons by a large margin. Because the regressor vector q(k) does not have a sliding window structure for successive k, the Sortline algorithm (see I. Pitas, “Fast algorithms for running ordering and max/min Calculation,” IEEE Trans. On Circuits and Systems, vol. 36, no. 6, pp. 795–804, June 1989) is not applicable.

Computation of ∥q_(J)(k)∥₂ ² can be done by means of a look-up table to further reduce the overhead associated with subset selection.

Simulation Examples

The application of the SPU-TD-LMS algorithm is now illustrated with a simple channel equalization problem. The channel input is assumed be a binary sequence taking on values ±1. The channel is a nonminimum-phase linear time-invariant system with a three-tap impulse response obtained from a sampled raised cosine (See S. Haykin, Adaptive Filter Theory, Prentice Hall, N.J., 3^(rd) edition, 1996):

${h_{k} = {\frac{1}{2}\left( {1 + {\cos\left( \frac{2{\pi\left( {k - 1} \right)}}{W} \right)}} \right)}},{k = 0},1,2.$ The equalizer is an adaptive transversal filter with N=12 parameters. The channel output signal is distorted by additive white Gaussian noise with zero mean and variance 0.001. The parameter W controls the condition number of the autocorrelation matrix for the noisy channel output.

The full-update TD-LMS, SPU-TD-LMS and NLMS algorithms were simulated on this channel for two values of W. The length parameters of SPU-TD-LMS were chosen as M=6 and B=1. In other words, ⅙th of the equaliser parameters are adapted at every iteration. DCT was used to form the orthogonal transformation T.

FIG. 12 shows the convergence curves for the three algorithms, averaged over 50 independent realizations, for W=3.5 (yielding a condition number of 48.8804). The stepsizes of the processes were chosen to attain approximately the same MSE after convergence.

FIG. 13 shows the same convergence curves for W=3.8, corresponding to a condition number of 233.4639 (i.e., the channel output signal is very ill-conditioned). Not surprisingly, the NLMS algorithm exhibits a very slow convergence for this channel.

In conclusion, a reduced-complexity TD-LMS process has been developed by employing selective-partial updating in parameter adaptation. A simple selection criterion is derived by drawing on the principle of minimum disturbance. The selection criterion picks the parameter blocks that correspond to the largest squared-Euclidean-norm regressor blocks. The regressor vector is obtained from the transformed time-domain regressor vector after power-normalizing the individual entries. The performance of the resulting reduced algorithm was simulated on a simple raised-cosine channel. The complexity reduction does not seem to slow down the convergence rate unduly. Compared with the NLMS algorithm, the SPU-TD-LMS has a very fast convergence speed.

It should be understood that the teachings provided herein may be implemented in software that can be stored or distributed in various forms of computer readable media, such as RAM, ROM, CD-ROM, magnetic disk, firmware, or other digital media. The software may also be downloaded across a computer network, such as the Internet. The software may be loaded and executed by a general or application-specific processor.

While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims. 

1. A method for performing adaptive filtering, the method comprising: normalizing frequency domain signals in at least two frequency bins; adaptively filtering the normalized frequency domain signals; and updating a given non-null subset of filter coefficients containing fewer than all filter coefficients used in the adaptive filtering, the given subset selected from among a plurality of subsets of filter coefficients used in the adaptive filtering based on a comparison of the given subset with other subsets of filter coefficients in the plurality of subsets at a present filtering iteration, the given subset and non-updated subsets of filter coefficients defining partially updated filter coefficients for adaptively filtering the normalized frequency domain signals at a next filtering iteration.
 2. The method according to claim 1 wherein normalizing the frequency domain signals includes dividing instantaneous energy levels of the frequency domain signals in each frequency bin by respective average energy levels of the frequency domain signals in each of the frequency bins.
 3. The method according to claim 1 wherein updating the given subset of filter coefficients includes solving a constrained minimization equation.
 4. The method according to claim 3 wherein solving the constrained minimization equation includes calculating a squared-Euclidean-norm.
 5. The method according to claim 1 wherein updating the given subset of filter coefficients includes determining which subset to update according to a least mean square (LMS) process.
 6. The method according to claim 1 wherein updating the given subset of filter coefficients occurs at every iteration of the adaptive filtering.
 7. The method according to claim 1 further including predefining the plurality of subsets of filter coefficients.
 8. The method according to claim 1 wherein normalizing the frequency domain signals is done as a function of a previous average energy level and a present instantaneous energy level of the respective normalized frequency domain signals.
 9. The method according to claim 8 wherein normalizing the frequency domain signals includes applying a scale factor to the previous average and present instantaneous energy levels of the respective, normalized, frequency domain signals to minimize a potential for adaptive filtering lock-up.
 10. The method according to claim 1 further including performing a time-to-frequency domain transform to provide the frequency domain signals from an input signal.
 11. The method according to claim 10 wherein the transform is an orthogonal transform.
 12. The method according to claim 1 further including minimizing an echo signal in a time domain signal related to the frequency domain signals.
 13. The method according to claim 12 wherein updating the given subset of filter coefficients includes targeting a burst echo signal in the time domain signal.
 14. The method according to claim 1 used in at least one of the following applications: system identification, channel equalization, or echo cancellation.
 15. An adaptive filter, comprising: normalizers to normalize frequency domain signals in at least two frequency bins; mathematical processing units coupled to the normalizers to combine the normalized frequency domain signals with respective adaptive filter coefficients; and an update unit to update a given non-null subset of filter coefficients containing fewer than all filter coefficients used in the adaptive filtering, the given subset selected from among a plurality of subsets of filter coefficients based on a comparison of the given subset with other subsets of filter coefficients in the plurality of subsets at a present filtering iteration, the given subset and non-updated subsets of filter coefficients defining partially updated filter coefficients that the mathematical processing units combine with the normalized frequency domain signals at a next filtering iteration.
 16. The adaptive filter according to claim 15 wherein the normalizers include mathematical processing units to divide instantaneous energy levels of the frequency domain signals in the frequency bins by respective average energy levels.
 17. The adaptive filter according to claim 15 wherein the update unit determines whether to update the given subset of filter coefficients by solving a constrained minimization equation.
 18. The adaptive filter according to claim 17 wherein the update unit solves the constrained minimization equation by calculating a squared-Euclidan-norm.
 19. The adaptive filter according to claim 15 wherein the update unit includes a least mean square (LMS) processing unit to determine which subset to update.
 20. The adaptive filter according to claim 15 wherein the update unit updates the given subset of filter coefficients at every iteration.
 21. The adaptive filter according to claim 15 wherein the plurality of subsets of filter coefficients are predefined.
 22. The adaptive filter according to claim 15 wherein the adaptive filter updates the given subset of filter coefficients as a function of a previous average energy level and a present instantaneous energy level of the respective normalized frequency domain signals.
 23. The adaptive filter according to claim 22 wherein the adaptive filter updates the given subset of filter coefficients by applying a scale factor to the previous average and present instantaneous energy levels of the respective, normalized, frequency domain signals to minimize a potential for adaptive filter lock-up.
 24. The adaptive filter according to claim 15 further including a time-to-frequency domain transform unit to provide the frequency domain signals from an input signal.
 25. The adaptive filter according to claim 24 wherein the transform unit is an orthogonal transform unit.
 26. The adaptive filter according to claim 15 wherein the mathematical processing units output an echo cancellation signal; and further including a summing unit coupled to the adaptive filter and a far end of a communications system to calculate an error signal by summing (i) a returning far end signal having an echo signal and (ii) the echo cancellation signal.
 27. The adaptive filter according to claim 26 wherein a feedback path couples the update unit to the summing unit to return the error signal to the update unit to allow the update unit to update the given subset of filter coefficients from among the plurality of subsets of filter coefficients to minimize the echo signal in the returning far end signal.
 28. The adaptive filter according to claim 15 wherein the update unit updates the given subset of filter coefficients in a manner that targets a burst echo signal in the far end signal.
 29. The adaptive filter according to claim 15 used in at least one of the following applications: system identification, channel equalization, or echo cancellation.
 30. An adaptive filter, comprising: means for normalizing frequency domain signals in at least two frequency bins; means for adaptively filtering the normalized frequency domain signals; and means for updating a given non-null subset of filter coefficients containing fewer than all filter coefficients used in the adaptive filtering, the given subset selected from among multiple subsets used in the adaptive filtering based on a comparison of the given subset with other subsets of filter coefficients in the multiple subsets at a present filtering iteration, the given subset and non-updated subsets of filter coefficients defining partially updated filter coefficients for adaptively filtering the normalized frequency domain signals at a next filtering iteration.
 31. A computer-readable medium having stored thereon sequences of instructions, the sequences of instructions, when executed by a digital processor, causing the processor to perform the steps of: normalizing frequency domain signals in at least two frequency bins; adaptively filtering the normalized frequency domain signals; and updating a given non-null subset of filter coefficients containing fewer than all filter coefficients used in the adaptive filtering, the given subset selected from among a plurality of subsets of filter coefficients used in the adaptive filtering based on a comparison of the given subset with other subsets of filter coefficients in the plurality of subsets at a present filtering iteration, the given subset and non-updated subset subsets of filter coefficients defining partially updated filter coefficients for adaptively filtering the normalized frequency domain signals at a next filtering iteration. 